Picture for Le Wang

Le Wang

Xi'an Jiaotong University

From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval

Add code
Apr 25, 2025
Viaarxiv icon

RSRNav: Reasoning Spatial Relationship for Image-Goal Navigation

Add code
Apr 25, 2025
Viaarxiv icon

Manipulating Multimodal Agents via Cross-Modal Prompt Injection

Add code
Apr 22, 2025
Viaarxiv icon

Moment Quantization for Video Temporal Grounding

Add code
Apr 03, 2025
Viaarxiv icon

CogMorph: Cognitive Morphing Attacks for Text-to-Image Models

Add code
Jan 21, 2025
Viaarxiv icon

Referencing Where to Focus: Improving VisualGrounding with Referential Query

Add code
Dec 26, 2024
Figure 1 for Referencing Where to Focus: Improving VisualGrounding with Referential Query
Figure 2 for Referencing Where to Focus: Improving VisualGrounding with Referential Query
Figure 3 for Referencing Where to Focus: Improving VisualGrounding with Referential Query
Figure 4 for Referencing Where to Focus: Improving VisualGrounding with Referential Query
Viaarxiv icon

Neural-Network-Enhanced Metalens Camera for High-Definition, Dynamic Imaging in the Long-Wave Infrared Spectrum

Add code
Nov 26, 2024
Viaarxiv icon

Revealing the Evolution of Order in Materials Microstructures Using Multi-Modal Computer Vision

Add code
Nov 15, 2024
Figure 1 for Revealing the Evolution of Order in Materials Microstructures Using Multi-Modal Computer Vision
Figure 2 for Revealing the Evolution of Order in Materials Microstructures Using Multi-Modal Computer Vision
Figure 3 for Revealing the Evolution of Order in Materials Microstructures Using Multi-Modal Computer Vision
Figure 4 for Revealing the Evolution of Order in Materials Microstructures Using Multi-Modal Computer Vision
Viaarxiv icon

Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval

Add code
Sep 30, 2024
Figure 1 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Figure 2 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Figure 3 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Figure 4 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Viaarxiv icon

PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation

Add code
Sep 08, 2024
Figure 1 for PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation
Figure 2 for PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation
Figure 3 for PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation
Figure 4 for PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation
Viaarxiv icon